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Appln. No.: 09/820,401 MATP-601US 
Amendment Dated September 21, 2005 
Reply to Office Action of June 21, 2005 

Amendments to the Claims: This listing of claims will replace all prior versions, and listinas 
of claims in the application ' y ' 

Listing of Claims: 

1. (Currently Amended) a method of displaying text information 

corresponding to a speech portion of audio signals of television program signals as a dosed 
caption on a video display device, the method comprising the steps of: 

dctcrmin i ng -i f - tho television signa l s I nc lu de cloccd caption info rfRetteftt 

tf5tag the closed- eaption i nformat i on if the television s i g na ls incfudc closed 



"ration ? 

decoding the audio signals of the television program; 

filtering the audio signals by using a spectral subtraction method to 
extract the speech portion; 

parsing the speech portion into discrete speech components in accordance 
with a speech model and grouping the parsed speech components employin g a fipp^pr 
ap pendant mod e Lto provide ph onemes as thejaarsed speech components wherein th*> 
speaker dependent model employs a hidden Markov m ort al ; 

receiving a training text as a part of the te le vision signal, the training text 
corresponding to a part of the speech portion of the audio signals; 

updating the hidden Markov moripl ha^ ri nn fho f ining text and rh^ part; 
oLthe^a eech portion pf_ the audio sig n a ls correspon di ng to the t rai ning text; 

aR£l Ying the u pjda ted hidde n_Ma rkov model to parse t h e speech portion of 
the audio signals to provide the phonemes; 
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identifying words in a database corresponding to the grouped speech 
components; and 

converting the identified words into text data for display on the display 
device as the closed caption. 

2. (Original) A method according to claim l, wherein the step of filtering 
the audio signals is performed concurrently with the step of decoding of later-occurring audio 
signals of the television program and step of parsing of earlier occurring speech signals of the 
television program. 

3. (Original) A method according to claim 1, wherein the step of parsing 
the speech portion into discrete speech components includes the step of employing a speaker 
independent model to provide individual words as the parsed speech components. 

4. (Original) A method according to claim 1 further including the step of 
formatting the text data into lines of text data for display in a closed caption area of the display 
device. 

5 -6 (Canceled) 

7. (Previously Presented) A method of displaying text information 
corresponding to a speech portion of audio signals of a television program as a closed caption 
on an video display device, the method comprising the steps of; 

decoding the audio signals of the television program; 

filtering the audio signals by using a spectral subtraction method to extract the 
speech portion; 

receiving a training text as a part of the television signal, the training text 
corresponding to a part of the speech portion of the audio signals; 
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generating a hidden Markov model from the training text and the part of the 
speech portion of the audio signals; 

x 

parsing the audio speech signals into phonemes based on the generated Hidden 
Markov model; 



identifying words in a database corresponding to grouped phonemes; and 

converting the identified words Into text data for presentation on the display of 
the audio-visual device as closed captioned textual data. 

8. (Original) A method according to claim 7, wherein the step of filtering 
the audio signals is performed concurrently with the step of decoding of later-occurring audio 
signals of the television program and step of parsing of earlier occurring speech signals of the 
television program. 



9. (Original) A method according to claim 7 further including the step of 
formatting the text data into lines of text data for display in a closed caption area of the display 
device* 



10. (Original) A method according to claim 7, further comprising the step 
of providing respective audio speech signals and training texts for each speaker of a plurality of 
speakers on the television program. 



11. (Currently Amended) Apparatus for displaying text information 
corresponding to a speech portion of audio signals of television program signals as a closed 
caption on an video display device, the method apparatus comprising: 

a-^resesse r w h i ch determines if the tcfcvisio n-frFog ram signols includ e cl o sed 
eapt4ere4F>fo rrriPtfon and enab l es the use of <r apttoR cd i nformation if t he-t-eJ ovision program 



si gnals include the dosed capt i on i nform a^on-ei 




module including; 
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a decoder which separates the audio signals from the television program 

signals; 

a spectral subtraction speech filter which identifies portions of the audio 
signals that include speech components and separates the identified speech component 
signals from the audio signals; 

a phoneme generator including a sp e aker-dependent speech reco gnition 
system which parses the speech portion into phonemes in accordance with a speech 
model which .Includ es a hidden Markov model r the phonem e ge nerator including: 

means for receiving a train ing t ext as a part of the televis ion 
signal, the training text corresponding to a part of the speech portion of the audio 
signals; 

me_ans_forjjpd atinq the hidden Markov mode l bas ed on the training 
text and the part of the speech portion of the audio sionals corresponding %g the 
training text; and 

means for applying the upd ated hidde n Markov model to pa rse the 
speech portion of the audio signals to proyip 'e jthe phonemes ; 

a database of words, each word being identified as corresponding to a 
discrete set of phonemes; 

a word matcher which groups the phonemes provided by the phoneme 
generator and identifies words in the database corresponding to the grouped phonemes; 
and 

a formatting processor that converts the identified words into text data for 
display on the display device as the closed caption. 

12, (Original) Apparatus according to claim ll f wherein the speech filter, 
the decoder and the phoneme generator are configured to operate in parallel. 
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13. (Original) Apparatus according to claim 11, wherein the phoneme 
generator includes a speaker independent speech recognition system. 

14 - 15 (Canceled). 

16. (Currently Amended) A computer readable carrier including 

computer program instructions that cause a computer to implement a method for displaying 
text information corresponding to a speech portion of audio signals of television program 
signals as a closed caption on an video display device, the method comprising the steps of: 

de termi ning if the te l ev i s i on signa l s inc l ude closed caption Information; 

ttstrwH he - closed capt i on i nformation i f the television s i gna l s inc l ude c l osed 
caption Information; ond 

i f tho telev i sion G i gna l s Hfe-^M^ftt^kH^e^lose 

decoding the audio signals of the television program; 

filtering the audio signals by using a spectral subtraction method to 
extract the speech portion; 

parsing the speech portion into discrete speech components in accordance 
with a speech model and grouping the parsed speech component s employing a speaker 
dependent model to, provid e phonemes as the parsed speech components wherein the 
speaker dependent model em ploys a hidd en Markov mo del ; 

receiving a training text as a part of the television signal, the training text: 
c orresponding to a part of the speech portion of the audio signals: 

updating the hidden Markov model based on the training te xt and the part 
of the speech portion of the audfo signals corresponding to the tr aining text: 
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identifying words in a database corresponding to the grouped speech 



components; and 



converting the identified words into text data for display on the display 



device as the closed caption, 

17. (Original) A computer readable carrier according to claim 16 r wherein 
the computer program instructions that cause the computer to perform the step of filtering the 
audio signals are configured to control the computer concurrently with the computer program 
instructions that cause the computer to perform the step of decoding the audio signals of the 
television program and with the computer program instructions that cause the computer to 
perform the step of parsing the speech signals of the television program. 



19. (Original) A computer readable carrier according to claim 16 further 
including computer program instructions that cause the computer to format the text data into 
lines of text data for display in a dosed caption area of the display device. 
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18. 



(Canceled) 



20. 



(Canceled) 
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